Discovering Conditions for Intermediate Reinforcement with Causal Models
نویسنده
چکیده
Learning to perform a task in an environment with sparse feedback is a difficult problem. While several approaches for increasing feedback during learning have been taken, these methods suffer from the dependency on human knowledge and engineering to find good solutions. We propose using causal models to increase the amount of feedback that will improve learning. This approach does not require domain-specific human engineering because causal models can be constructed directly from the environment using empirical data. Preliminary experiments and results show causal models can be used to automatically discover conditions for applying intermediate feedback that accelerate learning.
منابع مشابه
Discovering Causal Models of Self-Regulated Learning
New statistical methods allow discovery of causal models from observational data in some circumstances. These models permit both probabilistic inference and causal inference for models of reasonable size. Many domains, such as education, can benefit from such methods. Educational research does not easily lend itself to experimental investigation. Research in laboratories is artificial and poten...
متن کاملDiscovering temporally extended features for reinforcement learning in domains with delayed causalities
Discovering temporally delayed causalities from data raises notoriously hard problems in reinforcement learning. In this paper we define a space of temporally extended features, designed to capture such causal structures, using a generating operation. Our discovery algorithm PULSE exploits the generating operation to efficiently discover a sparse subset of features. We provide convergence guara...
متن کاملBehavior systems and reinforcement: an integrative approach.
Most traditional conceptions of reinforcement are based on a simple causal model in which responding is strengthened by the presentation of a reinforcer. I argue that reinforcement is better viewed as the outcome of constraint of a functioning causal system comprised of multiple interrelated causal sequences, complex linkages between causes and effects, and a set of initial conditions. Using a ...
متن کاملProbabilistic Computational Causal Discovery for Systems Biology
Discovering the causal mechanisms of biological systems is necessary to design new drugs and therapies. Computational Causal Discovery (CD) is a field that offers the potential to discover causal relations and causal models under certain conditions with a limited set of interventions / manipulations. This chapter reviews the basic concepts and principles of CD, the nature of the assumptions to ...
متن کاملBeam & Shell Models for Composite Straight or Curved Bridge Decks with Intermediate Diaphragms & Assessment of Design Specifications
In this research effort, the generalized warping and distortional problem of straight or horizontally curved composite beams of arbitrary cross section, loading and boundary conditions is presented. An inclined plane of curvature is considered. Additionally, the stiffness of diaphragmatic plates has been introduced in the formulation in order to compare with the case where rigid diaphragms are ...
متن کامل